Development of a TV Broadcasts Speech Recognition System for Qatari Arabic
نویسندگان
چکیده
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources. In this paper, a transfer learning framework is proposed to jointly use a large amount of Modern Standard Arabic (MSA) data and little amount of dialectal Arabic data to improve acoustic and language modeling. The Qatari Arabic (QA) dialect has been chosen as a typical example for an under-resourced Arabic dialect. A wide-band speech corpus has been collected and transcribed from several Qatari TV series and talk-show programs. A large vocabulary speech recognition baseline system was built using the QA corpus. The proposed MSA-based transfer learning technique was performed by applying orthographic normalization, phone mapping, data pooling, acoustic model adaptation, and system combination. The proposed approach can achieve more than 28% relative reduction in WER.
منابع مشابه
A Transfer Learning Approach for Under-Resourced Arabic Dialects Speech Recognition
A major problem with dialectal Arabic speech recognition is due to the sparsity of speech resources. In this paper, we propose a transfer learning framework to jointly use large amount of Modern Standard Arabic (MSA) data and little amount of dialectal Arabic data to improve acoustic and language modeling. We have chosen the Qatari Arabic (QA) dialect as a typical example for an under-resourced...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملMedia monitoring system for latvian radio and TV broadcasts
Media monitoring allows to capture media exposure of people, organizations and other important topics. This paper presents a media monitoring system for Latvian radio and television broadcasts. This system uses an automatic speech recognition (ASR) module to convert audio and video files to text and to extract keywords of interest. The system has been developed in close cooperation with Latvian...
متن کاملIntelligent Remote Control for TV Program based on Emotion in Arabic Speech
Recommender systems for TV program have been studied for the realization of personalized TV Electronic Program Guides. In this paper, we propose automatic emotion Arabic speech recognition in order to achieve an intelligent remote control. In addition, the TV can estimate our interests and preferences by observing our behavior to watch and have a conversation on topics that might be interesting...
متن کاملSpeech Recognition for Subtitling Japanese Live Broadcasts
There is a great need for more TV programs to be subtitled to help hearing impaired and elderly people to watch TV. NHK has researched automatic speech recognition for subtitling live TV programs in real time efficiently. Our speech recognition system learns frequent words and expressions expected in the program beforehand and also learns characteristics of announcers’ voices in order to reduce...
متن کامل